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Abstract 

To provide adequate multivariate measures of information flow between neural 
structures, modified expressions of Partial Directed Coherence (PDC) and Directed 
Transfer Function (DTF), two popular multivariate connectivity measures employed 
in neuroscience, are introduced and their formal relationship to mutual information 
rates are proved. 

1 Introduction 

Over the last decade Neuroscience has been witnessing an important paradigm shift thanks 
to the fast advancement of multichannel data acquisition technology. This process has been 
marked by the growing realization that the brain's inner workings can only be grasped 
through a detailed description of how brain areas interact functionally in a scenario that 
has come to be generally referred as the study of brain connectivity and which stands in 
sharp contrast to former longstanding efforts mostly directed at merely identifying which 
brain areas were involved in specific functions. 

As such, many techniques have been proposed to address this problem, specially be- 
cause of the need to process and make sense of many simultaneously acquired brain activit y 



signals, (IKaminski and Blinowska 



1991 



Sommer and Wichertl . 



2003 



Astolfi et al. 



20071 ). 



Among the availa ble methods, we introduced anc 



coherence (PDC) (jBaccala and Sameshima . 



2001b 



developed the idea of partial directed 



a) which consists of a means of dissect- 



ing the frequency domain relationship between pairs of signals from among a set of K > 2 
simultaneously observed time series. 
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The main characteristic of PDC is that it decomposes the interaction of each pair 
of time series within the set into directional components while deducting the possibly 
shrouding effect of the remaining K — 2 series. It has, for instance, been possible to show 
that PDC is related to the notion of Granger causality which corresponds to the ability 
of pinpointing the level of attainable improvement in predicting a tim e series Xi(n) when 
the past of another time series Xj{n) is known (i ^ j) ([Granger! . 1 1969). 



Liitkepohl 



(|l993l ) map 



In fact, multivariate Granger causality tests as described in 
directly onto statisti cal tests f or PD C nullity. Like Granger causality, and as opposed to 

), PDC is a directional quantity; this fact lead to the 



ordinary coherence ([Priestley 



1981 



idea of 'directed' connectivity that allows one to expressly test for the presence of feedback 
and to the idea that PDC is somehow associated with the direction of information flow. 
The a ppeal of associating PDC with in formation flow has been strong; we have used it 



ourselves (B accala and Sameshima 



2001b 



aj). Yet this suggestion has until now remained 



vague and to some extent almost apocryphal. The aim of this paper is to correct this state 
of affairs by making the relationship between PDC and information flow at once formally 
explicit and precise. 

On a par with PDC, is the no less i mportant notion of directed transfer function 



(DTF) by 



Kaminski and Blinowska (1991 



whose information theoretic interpretation is 
also addressed here. 

By p roviding further details and full proofs, this paper expands on our previous pub- 



lication ( Takahashietal 



201Q ) and is organized as follows: in Sec. [2] we provide some 



explicit information theoretic background leaving the main result to Sec. [3] followed by 
illustrations and comments in Sec. H] and [5] respectively. Detailed proofs are covered in 
the Appendix. 



2 BACKGROUND 



The relationship between two discrete time stochastic processes x = {x(k)} keZ and y = 
{y(k)} keZ is assessed via their mutual information rate MIR(j;, y) by comparing their joint 
probability density with the product of their marginals: 



MIRO,?/) 
lim 



m + 1 



-E 



log 



dP(a;(l),...,a;(m),y(l),...,i/(m)) 
dP(x(l),...,x(m))dP(j/(l),...,j/(m)) 



(1) 
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where E [•] is the expectation with respect to the joint measure of x and y and where 
dP denotes the appropriate probability density. An immediate consequence of ([T]) is that 
independence between x and y implies MIR nullity. 

Th e main classic result for jointly Gaussian stationary processes, due toY 



Gelfand and Yaglom 



(|1959l ). relates (P) to the coherence between the processes via 



MIR(x, y) -- 
where the coherence in ([2]) is given by 

C xy (ui) 



1 

47T 



logU-IC^u;)! 2 )^, 



(2) 



(3) 



with S xx (u) and S yy (Lo) standing for the autospectra and S xy {oj) for the cross-spectrum, 
respectively. 

The important consequence of this result is that the integrand in ([2]) may be interpreted 
as the frequency decomposition of MIR(x,y). 

In view of this result, the following questions arise: Does a similar result hold for PDC? 
How and in what sense? 

Before addressing these problems, consider the zero mean wide sense stationary vector 
process x(n) = [x\{n) . . . XK(n)] T representable by multivariate autoregressive model 

+oo 



x(ra) = A(/)x(n — I) + w(n), 



(4) 



i=i 



where w(n) = [wi(n) . . . WK(n)] T stand for zero mean wide sense stationary innovation 
processes with positive definite covariance matrix 5] w = E [w(n)w T (n)] . 

A sufficient condition for the existence of representation Q is that the spectral density 
matrix associated with the process { x(n))„ cZ be uni formly bounded from below and above 



and be invertible at all frequencies (jHannanl . 
we may write 



1970). From the coefficients aij(l) of A(Z) 



where j 



Aij (w) = 
-1 for U) G [— 7T, 7r). 



1- E M0e~ jaJ ', if i=j 
i=i 

+oo 

— E a ij{l) e ~' iLul i otherwise 
1=1 



(5) 
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Also let 3Lj{oj) = \A\j{bS) . . . Afcj(w)] T and consider the quantity, henceforth termed 
information PDC (tPDC) from j to i, 



(6) 



where an = E [io?(n)] and which simplifies to the originally defined PDC w hen Xl w equals 



Baccala et al. 



the id entity matrix. Note also that the generalized PDC (gPDC) from 
(120071 ) is obtained if 5] w is a diagonal matrix whose elements are not necessarily the same. 
Before sta t ing t he main result, note that to our knowledge, mention of ©, as in 



Baccala et al 



(|2006l ). has not appeared in the literature with any explicit association with 



information theoretic ideas. 



3 RESULTS 

3.1 PDC 

Theorem 1. Let the K-variate wide sense stationary time series x(n) = [xi(n) . . . x^(n)] T 
satisfy (J4j) ^ then 

^{u) = C WiVj (u), (7) 

where r]j(n) = Xj{n) — K[xj (n)\{xi(m) , I ^ j, m G Z}] which is known as the partialized 
process associated to Xj given the remaining time series. 

Corolary 1. Let the K-variate Gaussian stationary time series x(n) = [x\(n) . . . XK(n)] T 
satisfy then 

MIR(wi,T}j) = J Iog(l-|t7ry(w)| 2 )dw. (8) 

To obtain the process r/fe, remember that it constitutes the residue of the projection 
of Xk onto the past, the future and the present of the remaining processes. Hence its 
autospectrum is given by 

S rikVk( U ) = SxkXki") ~ s x fc x'=( w )S x fc x fc( w ) s x'=x fc (w)> (9) 

for x fc = [xh . . . xi K _ 1 ] T , {li, . . . , Ik-i} = {!>•••) K} \ {k} where s a . feX fc(a;) is the K — 1- 
dimensional vector whose entries are the cross spectra between Xk and the remaining K — 1 
processes, whereas S x *: x k(u;) is the spectral density matrix of x fc . The spectrum S v ,, v , (w) 



is also known in the literature as the partial spectrum of x^ given x fc (PriestleyJ, 



1981 



Note that 

gfcH =v fe ( w ) S x^( w ) ( 10 ) 

constitutes an optimum Wiener filter whose role in producing % is to deduct the influence 
of the other variables from to single out that contribution that is only its own. 

Theorem [T] shows that PDC from Xj to Xi measures the amount of information common 
to the r]j partial process and the Wi innovation. The proof is left to the Appendix but its 
main idea is to prove (|7|) so that © follows by use of ([2]) to produce MIR(wi, rjj). 

3.2 DTF 

Every stationary process {x(n)} ng ^ with autoregressive representation @ also has the 
following moving average representation 

+00 

x(n) = ^H(/)w(n-/), (11) 
z=o 

where the innovation process w is the same as that of 

In connection to the hij(l) coefficients of H(/), consider the matrix H(o;) with entries 

+00 

H l3 {u) = Y J h ij {l)e-'^\ (12) 
z=o 

and let h.j(u) = [Hj\(uj) . . . Hjx(uj)'j T whence follows the definition of information di- 
rected transfer function (tDTF) from j to i as 

t7i » = ^ )P » , (13) 
^hf(u;)5] w h» 

where pjj is the variance of the partialized innovation process Cj( n ) = Wj(n)—M[wj(n)/{wi(n), I 7^ 
j}] given explicitly by 

Pjj = n jj 

where o~j. is the vector of covariances for innnovations 

w-j(n) = [w h (n) . . .wi K _ 1 (n)] T where {k, . . . ,l K -i} = {1, ...,K}\{j} and is the 
covariance matrix of w J (n). 



When 5] w is the identity matrix, (|13p reduces to the original DTF from 



Kaminski and Blinowska 



(1 19911 ) . Also when 5] w is a diagonal matrix with distinct elements (fT3|) reduces to directed 



coherence as defined in 



Baccal et al. 



(119991 ) . 



For this new quantity, a result analogous to Theorem [T] holds. 



Theorem 2. Let the K-variate wide sense stationary time series~x.(n) = \x\{n) . . . xx(n)] T 
satisfy (fTTj) . then 

ijij(u)) = C XiCj (u;), (14) 
where Q is the previously defined partialized innovation process. 

Corolary 2. Let the K-variate Gaussian stationary time series x(n) = [x\{n) . . . XK{n)] T 
satisfy (fTT|) . then 

MIR(xi,(j) = ~ f log(l - Hj{cu)\ 2 )dw. (15) 

An important remark is that (|7|) / ()14j) hold for wide-sense stationary processes re- 
spectively with a autoregressive/moving average representations and that the gaussianity 
requirement is unnecessary for their validity. 

Also the integrands in ([8]) and (|15p are readily interpretable as mutual information 
rates at each frequency. 



4 ILLUSTRATIVE EXAMPLE 



Via the following simple accretive example it is possible to explicitly expose the nature of 

t*i ( n l fl Ti ( n — 1 i iiu in) 

(16) 

where K[wi(n)wj(m)] = 5 nm 5ij, for m,n £ Z and i,j G {1,2} with J pg standing for the 



xi(n) 









a?i(n — 


1)" 




w\(n) 








1). 


+ 




x 2 (n) 




a 




z 2 (n - 




u> 2 (n) 



usual Kronecker delta symbol. 
Clearly wri 2 (w) = and 



LTT 2 l(uj) 



-ae 



o. 



To obtain C^^ (w) using the fact that 

s 21 HS n 1 H = aeS" 

implies 772 (n) = x 2 (re) — ax\(n — 1) = if 2 (n) so that C Wl ^ 2 (o;) = 0, and hence wri 2 (o;) = 
Now to compute C^ 2r?1 (w) one must use the spectral density matrix of [x\ x 2 ] T given 

by 



SxiXl ( W ) 5'xiX2( W ) 

S^xi (w) 5' X2a;2 (w) 



1 ae 1 " 
ae" jw 1 + a 2 
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leading to the optimum filter 

G 1 H = s 12 HS 22 1 M = T ^ei- 
for K[xi(n) / {x2(m) , m £ Z}]. It is noncausal and produces 1h " q2 x 2 (n + 1) so that 



7/1 (n) = xi(n) 



a 



l + a 2 

Since x\(n) = w\(n) and 2:2(71) = aw\{n — 1) + 7^2 (n), 



x 2 (n + 1). 



which leads to 



and 



7/1 (n) = wi(ra) 2 - 7^2(71 + 1) " 2 



-ae~ 3UJ 

Sw 2V i M = 1+a 2 



s W2 {u) = 1, 



showing that 



CwJ2r/i (w) 



-ae 



confirms that WT2i(w) = C W2r)1 (a;) via direct computation of the Fourier transforms of the 
covariance/cross-covariance functions involving 7i; 2 and 7/1. 

so that direct computat i ons als o confirm tPDC 



It is easy to verify that Ci( n ) = w i{ n , 
and iDTF equality in the K = 2 case (jBaccala and Sameshima . 
identity matrix. 



2001al ) when £ is the 



Let model (|16p be enlarged by including a third observed variable 



^3(77) = fix2(n - 1) + w 3 (n) 



(17) 



where 77)3(71) is zero mean unit variance Gaussian and orthogonal to w\{ri) and 7712(77) for 
all lags. This new equation means that the signal x\ has an indirect path to X3 via xi but 
no direct means of reaching X3. 
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For this augmented model, the following joint moving average representation holds 



which produces 



x\(n) 




w\{n) 















w\{n 


-1)" 


x 2 (ri) 




w 2 (n) 


+ 


a 










u> 2 (n 


-1) 


x 3 (n)_ 




w 3 (n) 














w 3 (n 


















w\(n 


-2) 


+ 













w 2 (n 


-2) 




a/3 










w 3 (n 


-2) 



4721(0;) 

4732 M 

4731(0;) 



a e 



VTTa 



2 ' 



Vl + /3 2 + a 2 /3 2 ' 
a/Se- 2 ^ 



(18) 



and r/fci = for I > k by direct computation using (fT3j) . To verify fH]) . one obtains 
= Wi since the u>j innovations are uncorrelated leading to 

SWiM = a, = /3e" JaJ , 

S^gftM = a/3e~ 2ja \ S^aaM = 1 + a 2 , 

^3x3 M = 1 + /3 2 + a 2 /? 2 , 5 ClCl (w) = 1 = Sft^M, 



wherefrom 4721(0;) = 0^(0;), 4732(0;) = C X3 £ 2 (u) and 1731(0;) = C X3 ^(oj) using (J3JJ). 
One may compute this model's PDCs 



47r 2 i(a;) 



t7r 32 (o;) 



47r 3 i(a;) = 0, 

either via ©, or via Theorem [TJ 

This exposes the fact that the augmented model's direct interaction is represented by 
PDC whereas DTF from x\ to x 3 (|18|) is zero if either a or (3 is zero. This means that 
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a signal pathway leaving x\ reaches X3 so that D TF therefore represents t h e net d irected 
effect of x\ onto X3 as in fact previously noted in lBaccala and Sameshimal (|2001bl ). 



5 Discussion 

In their information forms both PDC and DTF represent true coherences and thus con- 
stitute complete alternative descriptions of the dynamic relations involving the observed 
vector time series x(n) and w(n), the innovations vector process, or its orthogonalized ver- 
sion C( n )i which summarize the stochastic novelty after stripping all mutual correlations 
present in x(n). 

DTF can be thought of as a forward description for it depicts how C( n ) affect the 
x(n) observations, whereas PDC describes how w(n) relates to ri(n) which is obtained 
by the mutual partialization of the components of x(n). Thus, r/j(n) essentially excludes 
those redundancies in Xj(n) that can be attributed to the other Xj{n) (j ^ i). This 
redundancy extraction, by direct analogy with linear algebraic procedures gives rise to 
= lj--->-^0 as a dual (also frequently termed reciprocal) basis to the {xi,i = 
1, . . . ,K} basis as its elements r\i are orthogonal to all Xj for i ^ j. It is in this precise 
sense that PDC's description is dual to DTF's - they map the innovations onto dual 
representations of the observed dynamics. 

A question that may come to mind is: how can DTF (PDC) being related to mutual 
information, a recognizedly reciprocal quantity, are able to describe unreciprocal aspects 
of the interaction between time series? The answer lies in that they relate the X{ (r/i) to 
innovations (j (vjj) so that permuting i and j describes the relationship between distinct 
inner component subprocesses. As such, for example, in the case of PDC, MIR(wi,r]j) (for 
|//7rjj(c<j)| 2 ) and MIR(wj,r]i) (for |i7Tjj(o;)| 2 ) are not equal in general as opposed to 

MlR(wi,r]j) = MIR(r]j,Wi) (19) 

whose equality always holds because \iTTij(u;)\ 2 = |i7r*-(u;)| 2 where * denotes complex con- 
jugation. In other words, index permutation in PDC entails comparing different underlying 
intrinsic component processes. A similar result holds for DTF. 

Another point is why PDC/DTF are related to Granger causality. This is so because 
the inherent decorrelation 

E [wi(n)wj(m)] = for all i,j provided that n 7^ m introduces the necessary time asymetry 

9 



to allow their causal interpretations. Also observe that by definition of innovation, time 
asymetry is an automatic consiquence of tUj(n)'s uncorrelation to T]j(k) for k < n. The 
same holds for £j( n ) which is uncorrelated with Xi(k) for k < n. 

Though left to the Appendix, the proof of Theorem [T] reveals an interesting aspect, 
namely eq. (|26l) that allows interp reting Ajj(ui) (|5l) as a transfer function from rjj to 



Wi. This observation sheds light on 



Schelter et al 



20091 ) 's employment of a studentized 



version of Aij(oS) in characterizing the relationship between x(n) com ponents. Similar 



Blinowska et al. 



observ ations hold for the Hij(u)), whose magnitude has been used by 

feoicl ). 

PDC and DTF are not alone as attempts to describe information flow betw een mul 
tivariate time series. To discuss these ideas one must also mention the efforts of 



(1982 


) and 


Hosova 


(1991) 



Geweke 



of their proposals vis— vis those described herein is beyond our intended scope and is plan- 
ned for future publications, it is perhaps reassuring to note when just time series pairs are 
considered {K = 2) all of the latter frequency domain measures coalesce into one and the 
same measure. 

As a matter of fact, for K = 2, it is possible to show that 



|z.7ry(a;)| 2 = |t7yMI' 



1 



-fj^i(cj) 



1 



(20) 



for (i, j G {1,2}) where fj_>j(u;) and Mj-^w) describe respectively Geweke's and Hosoya's 
frequency domain causal measures in their own notation (the arrow shows the direction 
information flow). Furthermore, when it comes to testing for the null hypothesis of Granger 
causality when K = 2, it is straightforward to verify the equivalence of the following 
statements: 

1. There is no Granger causality from Xj to X{. 

2. MIR(xi,(j) = 0. 

3. MSR(wi,rij) = 0. 

4. \LTTij(u})\ 2 = 0, Vw £ [-VT,7r). 

5. \nij(uj)\ 2 = 0, Vw G [-7r,vr). 

6. fj_^(o)) = 0, Vcj G [— 7r,7r). 
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7. Mj^iioj) = 0, Vu; € [-vr,vr). 



8. Aij(uj) = 0, Muj £ [— k,it). 

9. Hij(uj) = 0, Vw G [-vr,7r). 

Which of the above statements is more convenient depends on criteria like knowledge 
of precise asymptotic statistics and test power. In fact, precise results this kind for the 
general K > 2 case, that also include asymptotic confidence intervals, are known for 
\iTTij(uj)\ 2 and are being prepared for submission. 

Though time domain considerations are strictly outside our sc ope, they are req uired 



to fully understand the difference between the various measures (jTakahashil . 



20091 ) and 



underlie the difficulties o f genera" 
attempted respectively in 



i zing G ewe ke's and Hosoy a's proposals to K > 2 as 



Hosoya 



2001 



) while keeping a consistent 



Gewekel (|1984 ) and 
interpretation of information flow in association with Granger causality. 

A summary of the relationships between the underlying processes addressed in this 
paper is portrayed in Figure [U 

Finally, it should be noted that iPDC, as herein defined, provides an absolute signal 
scale invariant measure of direct connectivity strength between observed time series as 
opposed to either PDC or gPDC that provide only relative coupling assessments. 



6 Conclusion 

New properly weighted multivariate directed dependence measures between stochastic 
processes that generalize PDC and DTF have been introduced and their relationship to 
mutual information has been spelled out in terms of more fundamental adequately partial- 
ized processes. These results enlighten the relationship of formerly available connectivity 
measures and the notion of information flow. Theorem Q] is a novel result. For bivariate 
time series, results similar to Theorem [2] have appeared sev eral tim e s in t he literature 
in association with Geweke's measure of directed dependence iGewekd (|1982). The tDTF 



introduced herein is novel and constitutes a proper generalization of Geweke's result for 
the multivariate setting while tPDCs result (also novel) is its dual. 

The present results not only introduce a unified framework to understand connectivity 
measures, but also open new generalization perspectives in nonlinear interaction cases for 
which information theory seems to be the natural study toolset. 
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partialization 

w - c 

T 1 h 

* partialization * 

? X 

Figure 1: The diagram summarizes the relationship between the various descriptive pro- 
cesses associated with the original observations x including the innovations process w and 
their respective partialized versions r/ and £. The mutual information rate relationships 
are described by lPDC for (r/,w) and iDTF for (£,x) • Information flow sources are in- 
dexed by the greek type face vector components and the information receiving structures 
are chosen among the components of the latin type face processes. 
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A Appendix 

A.l Proof of Theorem [1] and Corollary CD 

Before proving Theorem Q] consider the following lemma: 

Lemma 1. Let S x (o;) be the power spectral density matrix of the stationary K-variate 
time series x(n) obeying . Then 

z j (u;) H V v , 1 a J (u) = S-^(u J ) (21) 

holds. 

Proof. One may rewrite ([5]) in matrix form as 

AH = I-^A(l)e^, (22) 
i=i 

where I is the K x iTidentity matrix. 

Because (|3J) holds, the inverse of the x may be written as 

S x 1 (a;) = AM^S w 1 A(a;). (23) 

each of whose elements 

[ S xV)],, = (S XjXj (u) - s xx U\)S x . x (w)s x j (w) 



follow s from the partitioned matrix inversion formula (see e. g. the appendix in 



Liitkepohl 



(|1993l )) which equals 

S-J te (a;) = af(a;)S- 1 a 3 -( W ). (24) 
by direct computation of [S x (w) — x ] .. from (|23|) . □ 

Lemma [T] allows rewriting ([6]) as 

iiTijiuj) = Aij(uj)a^ 1/2 ^ ' S VjT1j {uj). 

Hence to prove Theorem [1] all one must show is that 



y S WiWi (ui)Sr, jVj (^) 

Since 
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proving (f25|) reduces to showing 



Mu) = (26) 



By straightforward computation with help of ([10 



SwiVjiw) = ^Aiiu) {s XlXj (u) - », x (-•;S x ' x i^-is,. x .u\)} . 



1=1 

whose right-hand side can be broken as 



A ij (u})S r]:jVj (u})+ 

K 

("Kxi (">)}■ (27) 

which simplifies to 

StivyCw) = Aii(w)5 w (w) 
as the partialized process r/j is orthogonal to x J by construction, i.e. 

(w) - s XjXJ (w)S x / xJ (w)s XlX j (w) = 0, V I / j, w G [-7T, vr). (28) 

thereby concluding the proof. 

When Wi and ^ are stationary Gaussian, Corollary[T]is a direct consequence of applying 
the following 



1959). Let x and y be jointly Gaussian stationary time 



Theorem 3 (Gelfand and Yaglom 
series. Assume that 

E[w x (n)w y {n)} 2 < E[w 2 x {n)]E[w 2 y {n)}, 

where w x {n) and w y (n) are the innovations associated to x and y. Then the following 
equality holds: 

MIR(x,y) = -i- J* log(l - \C xy (uj)\ 2 )du, (29) 
when Wi and rjj are both jointly stationary Gaussian. 

A. 2 Proof of Theorem [2] and Corollary [2] 

To prove Theorem [2] recall that by definition 
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and that 



<% 3 M = Pjj- 



Therefore, it suffices to show that 



(30) 




By the existence of the moving average representation (jlip 



(31) 



Also, by the existence of moving average representation (jlip and the orthogonality of 
the partialized innovation process Q with respect to the innovations W[, I ^ j, it follows 
that 



and this concludes the proof. 

Corollary [2] follows immediately from Theorems [2] and [3l 

B ACKNOWLEDGMENTS 

The authors gratefully acknowledge support from the FAPESP/CINAPCE 2005/56464-9 
Grant. D.Y.T. to CAPES Grant and FAPESP Grant 2008/08171-0. L.A.B to CNPq 
Grants 306964/2006-6 and 304404/2009-8 K.S. to CNPq Grant 

References 

Laura Astolfi, Fabio Cincotti, D. Mattia, M. G. Marciani, Luiz A. Baccala, F. D. V. Fallani, 
S. Salinari, M. Ursino, M. Zavaglia, L. Ding, J. C. Edgar, G. A. Miller, B. He, and 
F. Babiloni. Comparison of different cortical connectivity estimators for high-resolution 
EEG recordings. Human Brain Mapping, 28:143-157, 2007. 

L. A. Baccala and K. Sameshima. Partial directed coherence: A new concept in neural 
structure determination. Biological Cybernetics, pages 463-474, 2001a. 

L. A. Baccala, D. Y. Takahashi, and K. Sameshima. Handbook of Time Series Analysis, 
chapter Computer intensive testing for the influence between time-series, pages 411-435. 



Wiley- VCH, 2006. 



15 



L. A. Baccala, D. Y. Takahashi, and K. Sameshima. Generalized partial directed coher- 
ence. In Cardiff Proceedings of the 2007 15th International Conference on Digital Signal 
Processing (DSP2007), pages 162-166, 2007. 

Luiz A. Baccala and Koichi Sameshima. Overcoming the limitations of correlation analysis 
for many simultaneously processed neural structures. Progress in Brain Research, 130 
(Advances in Neural Population Coding) :33-47, 2001b. 

L. A. Baccal, K Sameshima, G. Ballester, A. C. Valle, and C. Timo-Iaria. Studying 
the interaction between brain structures via directed coherence and Granger causality. 
Applied Signal Processing, 5:40-48, 1999. 

K. Blinowska, R. Kus, M. Kaminski, and J. Janiszewska. Transmission of brain activity 
during cognitive task. Brain Topography, 23(2):205-213, 2010. 

I. M. Gelfand and A. M. Yaglom. Calculation of amount of information about a random 
function contained in another such function. American Mathematical Society Translation 
Series, 2:3-52, 1959. 

J. F. Geweke. Measurement of linear dependence and feedback between multiple time 
series. Journal of the American Statistical Association,, 77:304-313, 1982. 

J. F. Geweke. Measures of conditional linear dependence and feedback between time series. 
Journal of the American Statistical Association, 79:907-915, 1984. 

C. W. J. Granger. Investigating causal relation by econometric models and cross-spectral 
methods. Econometrica, 37:424-438, 1969. 

E. J. Hannan. Multiple Time Series. John Wiley & Sons Inc.: New York, 1970. 

Y. Hosoya. The decomposition and measurement of the interdependency between second- 
order stationary processes. Probability Theory and Related Fields, 88:429-444, 1991. 

Y. Hosoya. Elimination of third-series effect and defining partial measures of causality. 
Journal of Time Series Analysis, 22:537-554, 2001. 

M.J Kaminski and K.J. Blinowska. A new method of the description of the information 
flow in the brain structures. Biological Cybernetics, 65:203-210, 1991. 



16 



H. Liitkepohl. Introduction to Multiple Time Series Analysis. Springer-Verlag: Berlin, 
1993. 

M. B. Priestley. Spectral Analysis and Time Series. Academic Press, London, 1981. 

Bjrn Schelter, Jens Timmer, and Michael Eichler. Assessing the strength of directed 
influences among neural signals using renormalized partial directed coherence. Journal 
of Neuroscience Methods, pages 121-130, 2009. ISSN 0165-0270. 

Friedrich T. Sommer and Andrzej Wichert. Exploratory Analysis and Data Modeling in 
Functional Neuroimaging. MIT Press, Cambridge, 2003. 

Daniel Y. Takahashi. Medidas de Fluxo de Informagdo com Aplicagao em Neurociencia. 
PhD thesis, Programa Interunidades de Pos-graduacao em Bioinformatica da Universi- 
dade de Sao Paulo, 2009. K. Sameshima and L. A. Baccala (Advisors). 

Daniel Y. Takahashi, Luiz A. Baccala, and Koichi Sameshima. Frequency domain con- 
nectivity: an information theoretic perspective. In "Proceedings of the 32nd Annual 
International Conference of the IEEE Engineering in Medicine and Biology Society, 
Buenos Aires", pages 1726-1729. IEEE Press, 2010. 



17 



